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Abstract 



We give a test that can distinguish efficiently between product states of n quantum systems 
and states which are far from product. If apphed to a state ji/') whose maximum overlap with 
a product state is 1 — e, the test passes with probability 1 — &{e), regardless of n or the local 
dimensions of the individual systems. The test uses two copies of We prove correctness of 
this test as a special case of a more general result regarding stability of maximum output purity 
^ , of the depolarising channel. 

' A key application of the test is to quantum Merlin-Arthur games with multiple Merlins, 

^ , , where we obtain several structural results that had been previously conjectured, including the 

fact that soundness amplification is possible and that two Merlins can simulate many Merlins: 
QMA(fc) — QMA(2) for k > 2. Building on a previous result of Aaronson et al, this implies 
CO ' that there is an efficient quantum algorithm to verify 3-SAT with constant soundness, given two 

^ , unentangled proofs of 0{y/n) qubits. Among other consequences, this result implies complexity- 

theoretic obstructions to finding a polynomial-time algorithm to determine separability of mixed 
, quantum states, even up to constant error, and also to proving "weak" variants of the additivity 

' conjecture for quantum channels. 

Finally, our test can also be used to construct an efficient test for determining whether a 
' unitary operator is a tensor product, which is a generalisation of classical linearity testing. 

o ■ 

1 Introduction 

•rH 

X 

' Entanglement of quantum states presents both an opportunity and a difficulty for quantum comput- 

ing. To describe a pure state of n qudits (d-dimensional quantum systems) requires a comparable 
number of parameters to a classical probability distribution on items. Effective methods are 
known for testing properties of probability distributions. However, for quantum states many of 
these tools no longer work. For example, due to interference, the probability of a test passing 
cannot be simply written as an average over components of the state. Moreover, measuring one 
part of a state may induce entanglement between other parts of the state that were not previously 
entangled with each other. 

These counter-intuitive properties of entanglement account for many of the outstanding puzzles 
in quantum information. In quantum channel coding, the famous additivity violations of [16, 26] 
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reflect how entangled inputs can sometimes have advantages against even uncorrelated noise. For 
quantum interactive proofs, the primary difficulty is in bounding the ability of provers to cheat using 
entangled strategies [30]. Even for QMA(fc) (the variant of QMA with k unentangled Merlins [32, 2]), 
most important open questions could be resolved by finding a way to control entanglement within 
each proof. Here, the recently discovered failure of parallel repetition for entangled provers [31] is 
a sort of complexity-theoretic analogue of additivity violations. 

The situation is different when we consider quantum states that are product across the n systems. 
In this case, while individual systems of course behave quantumly, the lack of correlation between 
the systems means that classical tools such as Cher noff bounds can be used. For example, in channel 
coding with product-state inputs, not only does the single- letter Holevo formula give the capacity, so 
that there is no additivity problem, but so-called strong converse theorems are known, which prove 
that attempting to communicate at a rate above the capacity results in an exponentially decreasing 
probability of successfully transmitting a message [40, 47]. Naturally, many of the difficulties in 
dealing with entangled proofs and quantum parallel repetition would also go away if quantum states 
were constrained to be in product form. 

1.1 Our results 

In this paper, we present a quantum test to determine whether an n-partite state |^) is a product 
state or far from any product state. We make no assumptions about the local dimensions of in 
fact, the local dimension can even be different for different systems. The test passes with certainty 
if is product, and fails with probability 0(e) if the overlap between lip) and the closest product 
state is 1 — e. An essential feature of our test (or any possible such test, as we will argue in Appendix 
C) is that it requires two copies of {ip). 

The parameters of our test resemble classical property-testing algorithms [18]. In general, these 
algorithms make a small number of queries to some object and accept with high probability if the 
object has some property P (completeness), and with low probability if the object is "far" from 
having property P {soundness). Crucially, the number of queries used and the success probability 
should not depend on the size of the object. The main result of this paper is a test for a property of a 
quantum state, in contrast to previous work on quantum generalisations of property testing, which 
has considered quantum algorithms for testing properties of classical (e.g. [13, 5]) and quantum [38] 
oracles (a.k.a. unitary operators, although see Section 5 for an application to this setting). In this 
sense, our work is closer to a body of research on determining properties of quantum states directly, 
without performing full tomography (e.g. the "pretty good tomography" of Aaronson [1]). The 
direct detection of quantities relating to entanglement has received particular attention; see [24] 
for an extensive review. However, previous work has generally focused on Bell inequalities and 
entanglement witnesses, which are typically designed to distinguish a particular entangled state 
from any separable state. By contrast, our product test is generic and will detect entanglement in 
any entangled state l'^). 

The product test is defined in Protocol 1 below, and illustrated schematically in Figure 1. It 
uses as a subroutine the swap test for comparing quantum states [12]. This test, which can be 
implemented efficiently, takes two (possibly mixed) states p, a of equal dimension as input, and 
returns "same" with probability ^ + ^tr pa, otherwise returning "different". 

The product test has appeared before in the literature. It was originally introduced in [37] as 

one of a family of tests for generalisations of the concurrence entanglement measure, and has been 
implemented experimentally as a means of detecting bipartite entanglement directly [45]. Further, 
the test was proposed in [38] as a means of determining whether a unitary operator is product. 



2 



Protocol 1 (Product test). 

The product test proceeds as follows. 

1. Prepare two copies of G C^^ (8) • • • (8) C"'"; call these lipi), \'4>2)- 

2. Perform the swap test on each of the n pairs of corresponding subsystems 
of \tpi), \tp2)- 

3. If all of the tests returned "same", accept. Otherwise, reject. 



Our contribution here is to prove the correctness of this test for all n, as formalised in the following 
theorem. 

Theorem 1. Given e C^i • • • (g) C''", let 

1 - e = max{\{'ilj\4>u . . . , 0n)|' : \(l>i) G C*, 1 < z < n}. 

Let PtestiW ii^l) be the probability that the product test passes when applied to \^). Then 

1 - 2e + < Pt,,t{\^){^\) < 1 - e + e2 + g3/2_ 

PuHhermore, ife> 11/32 > 0.343, PtesM {'^\) < 501/512 < 0.979. 
More concisely, Ptestilip) {tp]) = 1 - ©(e). 

This result is essentially best possible, in a number of ways. First, we show in Appendix C that 
the product test itself is optimal: among all tests for product states that use two copies and have 
perfect completeness, the product test has optimal soundness. We also show that there cannot exist 
any non-trivial test that uses only one copy of the test state. Second, our analysis of the test cannot 
be improved too much, without introducing dependence on n and the local dimensions. When e 
is low, we give examples of states \tj:) which achieve the upper and lower bounds on Ptest(|V')(V'l)) 
up to leading order. We also give an example of a bipartite state for which e is close to 1, but 
^tcst(|V')(V'l) ~ 1/2, implying that the constant in our bound cannot be replaced with a function 
of e that goes to as e approaches 1. (The bounds on this constant obtained from our proof could 
easily be improved somewhat, but we have not attempted to do this.) See Appendix B for all these 
examples. Finally, it is unlikely that a similar test could be developed for separability of mixed 
states, as the separability problem for mixed states has been shown to be NP-hard [25, 19] (and 
indeed we improve on this result, as discussed below). 

The proof of Theorem 1 is based on relating the probability of the test passing to the action of 
the qudit depolarising channel. In fact, we prove a considerably more general result regarding this 
channel. It is known that the maximum output purity of this channel is achieved for product state 
inputs [4]; our result, informally, says that any state that is "close" to achieving maximum output 
purity must in fact be "close" to a product state. This is a stability result for this channel, which 
strengthens the previously known multiplicativity result. 

Somewhat more formally, let Vs be the d-dimensional qudit depolarising channel with noise 
rate 1 — S, i.e. 

Vs{p) = {l-S){tvp)^ + 5p (1) 
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Figure 1: Schematic of the product test apphed to an n-partite state l'^). The swap test (vertical 
boxes) is apphed to the n pairs of corresponding subsystems of two copies of 1-0) (horizontal boxes). 



for p a arbitrary mixed state of one d-dimensional system, and define the product state out- 
put purity to be Pprod('^) = tr(I>®"' |0){0|)^, where 10) is an arbitrary product state. Then 

our main result, stated more precisely as Theorem 14 in Appendix A, is that for small enough 
5 > 0, if tr(Pf" |?/;)('0|)2 > (1 - e)Pprod((^), then there is a product state |0i,...,0n) such that 
m<\>U...An)?>\-0{e). 

1.2 Applications and interpretations of the product test 

We describe several applications of the product test. The most important of these is that this 
test can be used to relate QMA(A;) to QMA(2), as we will discuss in Section 3. The complexity 
class QMA(/c) is defined to be the class of languages that can be decided with bounded error by 
a poly-time quantum verifier that receives poly-size witnesses from k unentangled provers^ [32, 2]. 
To put QMA(/c) inside QMA(2) with constant loss of soundness, we can have two provers simulate k 
provers by each submitting k unentangled proofs, whose lack of entanglement can be verified with 
our product test. Indeed, this gives an alternate way to understand our test as a method of using 
bipartite separability to certify /c-partite separability. 

Surprisingly, using this result as a building block also allows us to prove amplification for 
QMA(A;) protocols. It has been conjectured [32, 2] that such protocols can be amplified to expo- 
nentially small soundness error. We completely resolve this conjecture, showing that QMA(A;) pro- 
tocols can be simulated in QMA(2) with exponentially small soundness error, and hence QMA(A;) = 
QMA(2) for k>2. 

As a further corollary, we can improve upon the results of [2, 9] to obtain a protocol in QMA(2) 
that verifies 3-SAT with constant soundness gap and 0(A/npoly log(n)) qubits (where n is the 
number of clauses). This in turn has consequences for the difficulty of approximating SEP(d, d), 
the set of separable quantum states on dx d dimensions. It was shown in Ref. [25] that SEP cannot 
be approximated to precision exp(— d) in time poly(ci) unless P = NP. In Refs. [35, 19], this result 
was improved to show that approximating SEP to precision 1/ poly(d) is similarly NP-hard. We 
show that there is a universal constant 5 > such that, if if is a convex set that approximates SEP 
to within trace distance 5, then membership in K cannot be decided in polynomial time unless 
3-SAT G DTIME(exp(^log°(^)(n))). 

Our result has two further corollaries, under the same assumption on the complexity of 3-SAT. 
First, we show that the minimum output entropy of a quantum channel cannot be estimated up to 
a constant in polynomial time. Second, wc show a hardness result for approximating the ground- 
state energy of quantum systems under a mean-field approximation. Our proof that amplification 

^We assume throughout this paper that k is at most polynomial in the input size. 
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of QMA(2) protocols is possible implies that one can derive stronger hardness results for these tasks, 
if one is willing to make stronger assumptions about the hardness of 3-SAT. 

Our final application is that the product test can be used to determine whether a unitary 
operator is a tensor product. This can be seen [38] as one possible generalisation of the well-studied 
problem of testing whether a boolean function {0, 1}" {0, 1} is linear [10]. This application is 
described in Section 5. 

These different applications of the product test reflect the many different interpretations of 
-Ftest(|V')(V'l)- It is related in a precise sense to 

• The purity of jV') after it is subjected to independent depolarising noise (see Appendix A). 

• The maximum overlap of \^p) with any product state (proved in Appendix B). The logarithm 
of this maximum overlap is an important entanglement measure known as the geometric 
measure of entanglement (see [46] and references therein). 

• The overlap of | with the tensor product of the symmetric subspaces of C^^^C^^ , . . . , C*^"® 
C^" (discussed in Appendix C). 

• The average overlap of with a random product state, and a quantum variant of the Gowers 
uniformity norm [20] (discussed in Appendix G). 

• The average purity of j^) across a random partition of [n] into two subsets (also discussed in 
Appendix G). 

2 Overview of the proof of correctness 

A full proof of correctness of the product test (Theorem 1) is given in Appendices A and B. In this 
section, we sketch the proof strategy. 

Our results are based on the following expression for the probability that an arbitrary state 
passes the product test: Ptcst(|V')('0l) = 2^ Ssc[n] ^'^'^'s- "^^^ connection with the depolarising 
channel follows from observing that the output purity of a state to which n copies of the depolarising 
channel with arbitrary noise rate have been applied is given by a similar but more complicated sum, 
with weights on each term. 

The proof itself is split into two parts, beginning with the case where e is low. The difficult part 
is the upper bound on Ptcst(|V')(V'l)- We write = ^Jl — ejO"") + \/e|0) without loss of generality, 
for some product state |0") and arbitrary state This allows an explicit expression for iiil^g in 
terms of e and |0) to be obtained. Each term of this expression is then carefully upper bounded 
to give an upper bound in terms of a sum of the amplitudes of 10), with weights that decrease 
exponentially with the Hamming weight of basis states. In order to obtain a non-trivial bound 
from this expression, the final stage of this part of the proof is to use the fact that |0") is the closest 
product state to j-^) to argue that \(p) cannot have any amplitude on basis states of Hamming 
weight 1. 

In the case where e is high (a regime essential to the application to QMA(fc)), this result does 
not yet give a useful upper bound. In the second part of the proof, we derive a constant bound on 
-ftest(|V')(V'|) based on considering as a fc-partite state, for some k < n. Ptest(|V')(V'|) can be 
shown to be upper bounded by the probability that the test for being product across any partition 
into k parties passes. Informally speaking, if j-^) is far from product across the n subsystems, we 
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show that one can find a partition such that the distance from the closest product state (with 
respect to this partition) falls into the regime where the first part of the proof works. 

This completes the overview of the proof; we now turn to applications of the product test. 

3 QMA(2) vs. QMA(A;) 

In this section, we apply the product test to a problem in quantum complexity theory: whether 
k uncntangicd provcrs arc better than 2 uncntanglcd provers. This question can be formalised as 
whether the complexity classes QMA{k) and QMA(2) are equal [32, 2]. These classes are defined 
as follows. 

Definition 1. A language L is in QMA(A;)s^c if there exists a polynomial-time quantum algorithm 
A such that, for all inputs x G {0, 1}"; 

1. Completeness: If x e L, there exist k witnesses . . . , each a state of poly {n) 
qubits, such that A outputs "accept" with probability at least c on input . . . \ipk)- 

2. Soundness: Ifx ^ L, then A outputs "accept" with probability at most s on input \x) . . . IV'fe); 
for all states ■ ■ ■ , \ipk)- 

We use QM/\{k) as shorthand for QMA(fc)i/3^2/3. and QMA as shorthand for QMA(l). We 
always assume 1 < k < poly(n). 

We also define QMA^(A;)s^c to indicate that ■ ■ ■ , \i/^k) each involve m qubits, where m may 
be a function of n other than poly(n). 

Two of the major open problems related to QMk{k)s,c are to determine how the size of the 
complexity class depends on k and on s, c. It has been conjectured for some years [32, 2] that in fact 
QMA(/c) = QMA(2) for 2 < k < poly(n), and that the soundness and completeness can be amplified 
by parallel repetition in a way similar to BPP, BQP, MA, QMA and other complexity classes with 
bounded error. In fact, these conjectures are related: 2k independent provers can simulate k 
independent realisations of a QMA(2) protocol in order to amplify the soundness-completeness gap, 
and conversely, [32, 2] proved that QMA(2) amplification implies that QMA(2) = QMA(poly). In 
this section, we will fully resolve these conjectures, proving that QMA(2) = QMA(poly) and that 
QMA(A;) can have its soundness and completeness amplified by a suitable protocol. 

The most direct way of putting QMA(A;) inside QMA(2) is to ask two provers to each send the 
k unentangled proofs that correspond to a QMA(A;) protocol. If k = poly(n), then each prover is 
still sending only polynomially many qubits. Then the product test can be used to verify that the 
states sent were indeed product states and can be used as valid inputs to a QMA(fe) protocol. The 
specific protocol is described in Protocol 2. 

First observe that for YES instances (instances in the language), the two Merlins can achieve at 

least the same success probability that k Merlins can in the original protocol, so the completeness 
can only increase. Now consider NO instances. Assume for now that the two Merlins always send 
the same state. Then according to Theorem 1, if the Merlins send states that are far from product, 
they are likely to fail the product test, whereas basic continuity arguments can show that if they 
send states that are nearly product then the success probability will not be much larger than the 
soundness of the original protocol. Thus, the soundness does not become too much worse. These 
ideas (with a detailed proof in Appendix D) establish 
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Protocol 2 (QMA(A;) to QMA(2)). 

The QMA(2) protocol proceeds as follows. 

1. Each of the two Merlins sends := IV'i) (8) ... (8) 1^"^) to Arthur. 

2. Arthur runs the product test with the two states as input. 

3. If the test fails, Arthur rejects. Otherwise, Arthur runs the algorithm A 
on the state sent by the first Merlin, and outputs the result. 



Lemma 2. For any m, k, < s < c < 1, 

QMAm{k)s,c C QMAjk^(2),,,, 

where s' = 1 - n{{l - s)^). 

This is already strong enough to achieve amphfication up to constant soundness. However, 
Protocol 2 has a salutary side effect that will allow us to achieve stronger amplification. To see 
this, we will first introduce a further generalisation of the QMA(A;) family. Let M be a set of 
Hermitian operators AI satisfying < M < I . Each M G M defines a binary measurement with M 
corresponding to the "accept" outcome and I — M corresponding to the "reject" outcome. Then 
define QMA^{k)s,c to be the class QMAfn{k)s,c with Arthur restricted to performing measurements 
from M. For example, if M = PROD is the set of product measurements followed by post-processing, 
then QMA'^^°°(A;) is the class BcUQMA(/c) that was proved equal to QMA (for constant k) by 
Brandao [11, 2]. We will be interested in taking M = SEP. Note that for such measurements, M 
is a separable operator, but / — M is not necessarily separable. 

Armed with the definition of QMA^^^, we can now see that Protocol 2 produces a protocol that 
is not only in QMA(2), but also QMA^^^(2). More formally, we can strengthen Lemma 2 to: 

Lemma 3. For any m, k, < s < c < 1, 

QMA^{k)s,c^QMAl^{2),,,, 

where s' = 1 - - s)^). 

Proof. Suppose the first Merlin sends systems Ai,...,Ak and the second Merlin sends systems 
Bi,. . . ,Bk. The "accept" outcome of the product test corresponds to the tensor product of pro- 
jectors onto the symmetric subspaces of AiBi, A2B2, ■ ■ ■ , Ai^Bi^. These are all separable across the 
A:B cut, and so their tensor product is as well. The second step is to simply apply a measure- 
ment entirely on Ai, . . . , A^, which is automatically separable. Finally, performing two separable 
measurements in a row creates a composite measurement which is also separable. □ 

The advantage of QMA^^^(A;) is that it removes the chief difficulty with QMA(fe) amplification, 
which is that conditioning on measurement outcomes can induce entanglement between systems 
we have not yet measured. This phenomenon is known as entanglement swapping. However, if we 

condition on the outcome of a measurement being M, for some Af G SEP, then no entanglement 
will be produced in the unmeasured states. As a result, cheating provers cannot gain any advantage 
by sending entangled proofs, and we obtain the following lemma. 
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Lemma 4. For any £>1, 

QMAfP(A;),,eCQMAf^P(fc),,,,,. 

The idea is to simply repeat the original protocol £ times in parallel and to accept iff each 
subprotocol accepts. Since we are considering QMA^^^ protocols, obtaining a YES outcome on 
one proof will not induce any entanglement on the remaining proofs. We give a detailed proof of 
Lemma 4 in Appendix D. 

Prom Lemma 3 and Lemma 4, we can almost conclude that strong amplification is possible. 
Indeed, when we start with protocols with perfect completeness, we can apply Protocol 2, repeat 
p{n) times, and reduce the soundness from s to For the case of c < 1, we need one 

additional argument to keep the completeness from being reduced too much at the same time. 
Here we will use a method for completeness amplification proved in both [32, Lemma 5] and [2, 
Lemma 6]. 

Lemma 5 ([32, 2]). For any £>1, 

Our amplification procedure for general c < 1 is then to 

1. Use Lemma 5 to bring the completeness exponentially close to 1. 

2. Use Lemma 3 to convert a general QMA(A;) protocol to a QMA^^^(2) protocol. 

3. Repeat the protocol polynomially many times to make the soundness exponentially small. 

This procedure then achieves 

Theorem 6. 1. If s < 1 — l/poly(n), k = poly(n) and p(n) is an arbitrary polynomial, then 
QMA(fc),,i = QMASEP(2),,p(_p(„))^i. 

2. If c — s > 1/ poly(n), k = poly(n) and p{n) is an arbitrary polynomial, then QMA{k)s,c = 

QMA^^^(2)exp(-p(n)),l-exp(-p(n))- 

We prove correctness of Protocol 2 and the rest of Theorem 6 in Appendix D. There are obvious 
variants of Theorem 6 to cover the case of limited message size, whose statements we leave implicit. 

4 Complexity-theoretic implications 

A key application of Theorem 6 is to the protocol of Ref. [2] that puts 3-SAT on n clauses inside the 
complexity class QMAiog(y^poly log(n))i_n(i),i. Applying Theorem 6 lets us simulate this using 
two provers with perfect completeness and arbitrary soundness, so that we obtain 

Corollary 7. Let £ -.N N be polynomially bounded. Then there is a universal constant < s < 1 
such that 

3-SAT e QMA^(„)^poiyiog(„)(2)^^(„) 

In other words, there is a 3-SAT protocol with two provers, £{n) '^/n poly log{n)-qubit proofs, perfect 
completeness and soundness s^^'^h 
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Therefore, making assumptions about the hardness of 3-SAT allows us to prove hardness results 
for the complexity class QMAiog(2), and stronger assumptions naturally imply stronger hardness 
results. We formalise this correspondence as the following corollary. 

Corollary 8. Assume that, for some function £ : N — )• N such that £{n) = o{n), 3-SAT ^ 
DTIME(exp(0(£(n)))). Then, defining d = 2^(")V^Poiyi°gW, 

QMAi„g(,)(2)^,(„) ^ ^ DTIME(poly(d)) 

for some universal constant < s < 1. In particular, taking i{n) = 1, we have 

QMAiog(d)(2).,i^DTIME(poly(d)) 

under the assumption that 3-SAT ^ DTIME(exp(-v/nlog*^(^)(n))). 

Note that the (not implausible) Exponential Time Hypothesis of Impagliazzo and Paturi [29] 
states that 3-SAT ^ DTIME(exp(^(n))) for any £{n) = o{n). We conclude this section by discussing 
three natural QMAiog(2)-complete problems whose hardness is implied by this hypothesis. For 
simplicity, we focus on the weakest possible hardness assumption in Corollary 8, but it should be 
easy to sec how to strengthen this assumption to obtain stronger results. 

First observe that the acceptance probability in a QMA^(2) protocol can be expressed as 
™8^peSEP(2™,2™) tr M/3, where < M < / is the measurement resulting from the verifier's quantum 
circuit and SEF{dA, ds) denotes the set of separable density matrices on d^ x ds dimensions. In 
other words, QMA^(2) is equivalent to optimising a linear objective function over the convex set 
SEP(2™,2™). 

In some cases, it may be useful to obtain an explicit description of M. This can be achieved 
up to error e by running the verifier's circuit poly(2"*, 1/e) times and performing tomography. 
As a result, we trivially obtain that QMA„j(2)s^c ^ NTII\/IE(poly(2™, n, l/(c — s))). In particular, 
QMA(2) C NEXP. Unfortunately this cannot be scaled down to place QMAiog(2) in NP. This is 
because the verifier in a QMAiog(2) protocol still can perform a poly-time quantum computation. 
Thus, we only have that QMAiog(2) C NP^^''. 

Application 1: Separability-testing. A folk theorem of convex optimisation [22] states that the 
problem of optimising a linear function over a convex set, such as SEP, is equivalent to determining 
membership in that set. Thus, we should be able to relate QMAiog(2) to the problem of determining 
membership in SEP. To make this precise, for any convex if C we define B{K,e) to be 

: 3y G K, \\x — y\\ < e} when e > and {x : $y ^ K, \\x — y\\ < — e} when e < 0. The 
weak membership problem for K, WMFiM^lK), is to determine whether a point x belongs to 
B{K,eY or B[K,—e) given the promise that one of these is the case. The weak optimisation 
problem for K, WOPT£(A') is to maximize a linear objective function over any set L satisfying 
B{K,—e) C L C B{K,e). Given some mild conditions on K, we can reduce WOPT£(Ar) to 
WMEM£/poiy(^-)(A') in polynomial time [22]. This fact has been used to show the NP-hardness 
of WMEMi/poiy(SEP) in Refs. [35, 19, 7] and, previously, of WMEMi/exp(SEP) by Gurvits [25] 
(although the connection to QMAiog(2) was only observed by [7]). 

Unfortunately, many of these techniques break down in the setting of constant error. We believe 
that it should not be possible to approximate SEP((i, d) to within a (sufficiently small) constant 
accuracy in time poly{d). However, we are able to rule out only algorithms that have the further 
restriction of recognizing a nearly convex set that in turn approximates SEP to constant accuracy. 



9 



Corollary 9. Let K be a convex subset of the space of d? x d? Hermitian matrices such that K C 
i3(SEP((i, d), 5) and SEP(d, d) C B{K, 6), where B{-, ■) is defined relative to the trace norm and 6 > 
is a universal constant. Then, assuming 
cannot be decided in time poly(o?). 

(As with the other hardness results in this section, the precise value of 6 is determined by the 
protocol in Ref. [2].) 

Proof. Solving WMEMx/poiy(^-)(i(') in polynomial time would allow us to solve WOPTi/poiy((^)(ii') 
in poly time, which in turn would give a poly-time algorithm for WOPT^_|_iypQiy^(^) (SEP). This last 
claim, together with Corollary 8, would contradict the hypothesis on the complexity of 3-SAT. □ 

Application 2: Minimum output entropy of quantum channels. Our results also have implica- 
tions for quantum information theory. Let M denote a quantum channel with d-dimensional input 
and output. Define the minimum output Renyi a-entropy of M to be <S™°(A/') = minp Sa{M{p)), 
where Sa (c) = log tr a" and the minimum is taken over all quantum states p. Note that 
S^^^'^iAT) is also equivalent to j^log||AA|| i^a, where || • (also called in e.g. [4]) is the 

£i — )■ £p norm. When a = 0, l,oo, we define Sa{cr) by letting a approach these values, obtaining 
So{a) = log rank (7, Si{a) = — trcr log cr and 5'oo(o") = — log ||(t||oo- Additivity of S'f™(A^), the 
minimum output entropy, is intimately connected to additivity of the Holevo capacity [42, 8]. 

It was observed by Matsumoto [36] (citing a personal communication from Watrous) that the 
maximum acceptance probability of a QMAm(2) protocol is precisely ||A/'||i^oo for some quantum 
channel J\f acting on d = 2™ dimensions. For completeness, we give a proof of this in Appendix E. 
This implies that determining whether S™^{M) is > log(l/s) or < log(l/c) is a complete problem 
for QMAiog(^)(2)s^c under BQP reductions. 

The QMA(2)-completeness of estimating .S™"^ implies that other information-theoretic quanti- 
ties that are close to S*™™ are also hard to approximate. For example, for any a > 0, we have 

"'(AA) > S^"'{U) but also 5°"'^(AA) = iff ^^"(AA) = 0. Thus, our hardness result for approxi- 
mating S™"^ immediately translates to a hardness result for approximating 5™°. 

Corollary 10. There exists a universal constant 5 > such that for any a > 0, if 3-SAT 
DTIME(exp(-v/nlog'^^^^(n))) then it is impossible to determine whether S'™™(A/") = or S'™°(A/") > 
S in worst-case time poly(d). 

Beigi and Shor previously showed that it is NP-hard to compute the minimum output entropy 
up to 1/ poly((i) accuracy [8]. Our result improves theirs, but under a more restrictive complexity 
assumption. Another major goal in information theory is to estimate the regularised minimum 
output entropies of quantum channels, which are defined to be 

SR,^^(^f) := liminf -Sf^'iM^"). 

The S'q '"""^^(A/") arc relevant to determining the ultimate channel capacity, to proving strong converse 
theorems [33] and to cryptographic protocols [34] . Our hardness result for S*™"^ immediately gives us 
the equivalent hardness result for Sa'™'"- The reason is that our proof of amplification for QMA(2) 
protocols (see Lemma 4) essentially works by constructing a channel J\f for which S^^^'^{M) = 
S^'^iAT) by design. 

For general channels, we automatically have 6*0 '™'"(AA) < S'™™(A/'); however, the famous failures 
of the additivity conjecture imply that sometimes this inequality can be strict, with examples known 
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for a > 1 [26, 27] and for (X near [14]. Still, these examples only demonstrate that 5*^' can 
deviate very slightly from 5"™'°. On the other hand, various lower bounds for S*,^'™'" are known 
[44, 48, 15, 49], and it may be that one of these bounds could be related to 5™°, thereby proving 
that /Sq'™'" cannot be fax from -S™'". Our results do not rule out the possibility that iS™™ may 
be fruitfully related to iSq'™™. However, they do imply that these lower bounds on ^a'™'" (and 
thereby on S*™™) are unlikely to be efficiently computable, or if they are, they are likely to be 
extremely loose bounds in general. 

Application 3: mean- field approximation. Finally, we discuss an application from condensed- 
matter physics. Consider a system of n c/-dimensional quantum systems arranged in a lattice with 
identical nearest-neighbour pairwise interactions. The mean-field approximation replaces the true 
nearest-neighbour graph with the complete graph. When the number of spatial dimensions is 3 
(or more), this is often a reasonable approximation, li K \s & fixed two-qudit Hamiltonian and 
Ki^j denotes the action of K on systems j and the identity on the other systems, then the total 
Hamiltonian '\s H = ^i^j Kij. To set the overall scale of the problem, assume that < K < I. 
One of the more important physical questions about H is to determine its ground-state energy; 
that is, its smallest eigenvalue. 

In Ref. [17], the quantum de Finetti theorem was used to show that when n ^ (f, then the 
ground state of H is very close to a product state. In this case, finding the ground-state energy of 
H is equivalent to minimising tv pK over all p G SEP((i, d). Again applying Corollary 8, we obtain: 

Corollary 11. Assuming 3-SAT ^ DTIME(exp(y^log*^^^^(n))) and with H defined as above, it is 

impossible to estimate mm{tT pK : p € SEP((i, d)} in time poly{d) to within o(l) error. Equiva- 
lently, it is impossible to estimate the ground-state energy of H to within additive error o(n^). 

Previous work on the hardness of approximating ground-state energy of quantum systems gen- 
erally had d constant and only ruled out the possibility of l/poly(n) approximation error. In 
terms of approximation errors, our result achieves one of the goals of the conjectured quantum 
PCP theorem [3] . However, we require d to grow asymptotically, and we achieve a hardness result 
much weaker than QMA-hardness. Indeed, due to the classical PCP theorem combined with the 
Exponential Time Hypothesis, finding the ground state of a system of log{d) bits (without any 
symmetry constraint) is likely to require time exp((i^ log(d)), while our results merely imply an 
r2(exp(log^((i))) lower bound. Still, our result provides a superpolynomial bound on an important 
class of Hamiltonians that had been previously considered to be computationally easy to work with. 

5 Testing for product unitaries 

As well as being useful for testing quantum states, the product test has applications to testing 
properties of unitary operators. The results we obtain will be in terms of the normalised Hilbert- 
Schmidt inner product, which is defined as {M,N) := ^tiM^N for M,N e M{d), where M{d) 
denotes the set of d x d matrices. Note that, with this normalisation, |(?7, F)| < 1 for unitary 
operators U,V. The following correspondence (also known as the Choi-Jamiolkowski isomorphism), 
underlies our ability to apply the product test to unitaries. 

Let 1$) be a maximally entangled state of two d-dimensional qudits, written as \i,i) 
in terms of some basis B = (|1), . . . , |d)). For any matrix M G M{d^), define \v{M)) := (M (g) 
7)1$)®". Then {j\{k\v{M)) = -^^J^- In particular, for any matrices M,N e M{(r'), {M,N) = 
{v{M)\v{N)) =trMtiV/d". 
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We consider the problem of testing whether a unitary operator is a tensor product. That is, 
we are given access to a unitary U on the space of n qudits (for simplicity, restricting to the 
case where each of the qudits has the same dimension d), and we would like to decide whether 
U = U\® ■ ■ ■ ®Un- This is one possible generalisation of the classical problem of testing linearity 
of functions / : {0,1}" {0,1} [10]; the classical special case is obtained by restricting U to be 
diagonal in the computational basis and to have diagonal entries all equal to ±1. 

In Protocol 3 we give a test that solves this problem using the product test. The test always 
accepts product unitaries, and rejects unitaries that are far from product, as measured by the 
normalised Hilbert-Schmidt inner product. 

Protocol 3 (Product unitary test). 



The product unitary test proceeds as follows. 

1. Prepare two copies of the state |$)®", then in both cases apply U to the n 
first halves of each pair of qudits to create two copies of the state \v{U)) G 

2. Return the result of applying the product test to the two copies of \v{U)), 
with respect to the partition into n d"^ -dimensional subsystems. 



Let the probability that this test passes when applied to some unitary U be Ptest(t^)- Then we 
have the following theorem, which proves a conjecture from [38]. 

Theorem 12. Given U G UicT"), let 

l-e = max{[(i7,Vi®---®F„)[2 : ^1,...,^ G U{d)}. 

Then, if€ = 0, PtestiU) = I. If e < 0.106, then Ptest{U) <l-\e+^e'' + ie^/s. // 0.106 < e < 1, 
Ptest{U) < 501/512. More concisely, Ptest{U) = 1 - e(e). 

The proof is given in Appendix F. It is not quite immediate from the previous results; the 
key problem is that the closest product state to \v{U)) may not correspond to the closest unitary 
operator to U. 

Our test is sensitive to the Hilbert-Schmidt distance of a unitary from the set of product 
unitaries. One might hope to design a similar test that instead uses a notion of distance based 
on the operator norm. However, this is not possible. For example, if we could detect a constant 
difference in the operator norm between an arbitrary unitary U and the set of product unitaries 
then we could find a single marked item in a set of size d". By the optimality of Grover's algorithm, 
this requires fi(d"/^) queries to U. More generally, any test that uses only a constant number of 
black-box queries to U can only detect an difference in an 0(1) fraction of the dimensions 
that U acts upon. 



6 Conclusion 

Our main result can be seen as a "stability" theorem for the output purity of the depolarising 
channel. It is an interesting problem to determine whether a similar result holds for all output 
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Renyi entropies for the depolarising channel, or even for all channels where additivity holds. As 
a more modest open question, can Theorem 1 be tightened further, perhaps by improving the 
constant in the e^/^ term? It would also be interesting to improve the constants in Theorem 1 
in the regime of large e, as at present they are extremely pessimistic. The regime of large e is 
generally somewhat mysterious: for example, we do not know the minimum value of Ptestj or the 
largest distance from any product state that can be achieved by a state of n qudits. 
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A The depolarising channel 

Let Vs be the qudit depolarising channel as defined in equation (1). We will be interested in 
applying the n-fold product P®" to states of n qudits, and in particular in the purity of the 
resulting states. This has the following characterisation. 

Lemma 13. We have 



and in particular 



i/vd+i'^^ (d + iy 

SC[n] 



and for pure product states, 



Proof. Consider some Hermitian operator basis for B{C'^) which contains the identity and is or- 
thonormal with respect to the normalised Hilbert- Schmidt inner product {A,B) = ^^rA^B, and 
extend this basis to i3((C'^)®"^) by tensoring. Expand p in terms of the resulting basis as 

te{o,...,(i2-i}" 

where pt G M, xt represents an element of the tensor product basis corresponding to the string 
t G {0, . . . , — 1}", and the identity is indexed by at each position. Then we have 



tr(p|) = d^-l^l I pI 
kt,ti=0,Vie5 
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and hence, for any 6, 



SQln] 5C[n] \t,ti=0,VieS / t 



SC[n], 

\ti=o,^ies J 



/n-\t\ , 

= ''"'E''? E ( 

t y a;=0 ^ 

= d2"^p2(5/ci)|t|(l + 5/d)""l*l 
t 

= M<! + '5))"E''tW(* + <'))l'l 

Rearranging completes the proof; the two special cases in the statement of the lemma can be verified 
directly. □ 

Using the above lemma, we can see that maximal output purity is obtained only for product 
states, since only product states saturate the inequality tr pi < 1 for all 5 C [n]. We will now prove 
our main result, which is a "stability" theorem for the depolarising channel: if a state achieves close 
to maximal output purity, it must be close to a product state. 

Theorem 14. Given G (C'^)^'", let 

1 - e = max{|(^|,^i, . . . , </.„)p : |</.i), . . . , |</.„) G C^}. (2) 

Then 

.ivf' imif < (i - 4.(1 - + 4.3/^ da^^i' , 

In particular, 

*^(^f/"x/d+T ^ PpUl/VdTl) (l - e + 6^ + 6^/2) . 

Proof. Without loss of generality assume that one of the states achieving the maximum in Eq. (2) 
is |0)®", which we will abbreviate simply as |0"), or |0) when there is no ambiguity. We thus have 

|V) = vT^|o) + Vi|</') 

for some state \(f)) such that (0|^) = 0, and \(f)) = J2x^o'^x\^) ^'^^ some {a^}. We write down 
explicitly 

^|; := = (1 - e)|0)(0| + ,/7{r^m{cP\ + |</))(0|) + e[ 

By Lemma 13, 

i:2\ " 



2N 



^ ^ SC[n] 
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where we set 7 = dS'^/{l — 5^) for brevity. Now 

J2 7l^ltrV'|= Yl 7l^l(tr((l-e)|0)(0|5 + V^(r^(|0)(<^|5 + |<^)(0|s) + e|<^)(<A|5)'), 

SC[n] SC[n] 

and for any subset S, 

tr^l = (l-e)2tr|O)(O||+6(l-6)tr(|O)(,^| + |</>)(O|)|+62tr|0)(0|| 

+ 2^(1 - efHi |0)(0|5(|0)(,^| + \cP){0\)s + 2e(l - e) tr |O)(O|5|0)(</.|5 
+ 2e3/2^r3^tr|</.)(</.|5(|O)(0| + |<^)(O|)5. 

We now bound the sum over S (weighted by 7!'^!) of each of these terms, in order. Note that we 
repeatedly use the notation [E] for a term which evaluates to 1 if the expression E is true, and 
if E is false. 

1. As 1 0) is product, clearly 

^7l"ltr|0)(0||= 5;7l"l = (l + 7r. 

SC[n] SC[n] 

2. We have 

tr(|O)(0| + |0)(O|)| = tr |0)((A|| + tr |0)(O|| + 2 tr \0){^\s\^){0\s. 

It is easy to see that the first two terms must be for all S (as only the off-diagonal entries 
of the first row of the matrix \0){(t>\ can be non-zero). For the third, we explicitly calculate 

|0)(<^|5|</')(0|5 = Yl = o,yie S]\0){0\^\ 

and hence 

J] 7l^ltr|0)(,^|5|^)(0|5 = 5^|a.p 5^ 7l^l[^^ = 0,V^G-S] 

SC[n] x^Q SC[n] 




3. It clearly holds that tr 10) (011 < 1, so as in part (1), 

Y 7l''ltr|0)(0||<(l+7r, 
SC[n] 

and this will be tight if and only if |0) is product itself. 

4. Using the same argument as in part (2), tr |O)(O|5|O)(0|5 = tr |O)(O|5|0)(O|5 = 0. 
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5. Write the state (j) = ^ 

= XI (t^xi,...,yn\xi){yi\ \Xn){yn\ 



Then, for any S = {ii, . . . ,ik}, 

(t>s = Y^[xi = yiyi e (8) • • • (g) \xi^){yik\, 



which imphes 

tr |O)(O|5|0)(0|5 = ^[xi = 0, Vi G S]\a^\\ 

X 

and hence, similarly to part (2), 

X7i"itr|o)(o|,i^>(^i,= X7^(^" J ) = (i+7rEKi'(rT^) 



SC[n] k=0 ^ ' x^O 

6. The last term can be trivially bounded using 

|tr|,^)(,^|5(|0)(</.| + |,^)(0|)5|<2. 
However, it is possible to get a better bound with a bit more work. We expand 

7l^ltr|</>)(</>|5|O)(0|5 = 

5C[n] 

X 7''^' XI = 0,i G S\\xi = yi,ie 5] tr |a;i)(yi|0)(2;i| |a;„)(y„|0)(2;„| 

5C[n] x,y,z 

= X ^x'4<i^i = 0,z G S][xi = yi,i G S][yi = 0,i G S][xi = Zi,i G S] 

SC[n] x,y,z 

= J2 "yv^a^a* X t'^' = 0, i G S] [zi = 0, z G 5] 

\yAz\=0 SC[n] 

\yAz\=o 

This expression can be upper bounded as follows: 

Q!j/V2a*a*7 



E 

|2/Az|=0 



|yA2;|=0 



\yAz\=o 



7 



2\z\ 



(1 + j)2\yVz\ 



\|j/Az|=0 
1 , 2 \ l^^l ^ 



z = a; 
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Combining these terms, we have 

J2 V'l < (1 + 7)"((1 - ef + 2e(l - e) ^ + 7)-l-i + 1) + e'+ 

SC[n] x^Q 




Note that (1 + 7)-l^l(7l^l + 1) decreases with |a;| for aU 7 > 0, as docs (1 + 72)1^1(1 + 7)-2kl_ Xo 
complete the proof, we wih show that {(p) has no weight 1 components (i.e. eta; = for |x| < 2). In 
the contribution from Eq. (3), this imphes that only the |a:| > 4 terms contribute (since x = yV z 
and y A z = 0). Therefore, having no weight 1 components would imply that 

J2 7l^ltrV'|<(l+7) 

SC[n] 

which would imply the theorem. Now, for any 0, if, we have 1 — e > |(cos6(0| + e*''' sin^(l|) (g) 
(0|®"-i|V')p. Picking such that 

v/Ko|V)P + Kio"-MV')P' 

and if such that 6**^(10""^ 1-0) > 0, it is easy to see that 

l-e> |cos0{O|?/») + e^'^ sin 0(10"- = KOIV')!^ + 1(10"-^)^ 

However, we have assumed that 1 — e = |(0|'i/')P, so this implies that (10"'-^|V') = 0. Repeating the 
argument for the other n — 1 subsystems shows that is indeed orthogonal to every state with 
Hamming weight at most 1, so |^) has no weight 1 components. □ 



B Proof of Theorem 1: correctness of the product test 

In this appendix, we prove correctness of the product test (Theorem 1). Let the test be defined as 
in Protocol 1. The following lemma from [38] expresses the probability of passing in terms of the 
partial traces of the input states; we include a proof for completeness. 

Lemma 15. Let Ptestip, o") denote the probability that the product test passes when applied to two 
mixed states p,a e B{C^^ (g) • • • (g) C''"). Define Ptest{p) ■= Ptest{p, p)- Then 

Ptestip, cr) = ^ X] t^P-SO-S, 

SQ[n\ 

and in particular 

Ptestip) = ^ XI 

5C[n] 

If d\ = d2 = ■ ■ ■ = dn = d, for some d, then 

P..(p)=(^)"tr(Pf;^pf. 
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Note that we can in fact assume that di = d2 = ■ ■ ■ = dn = d without loss of gcnerahty by 
setting d = max(di, . . . , dn), and embedding each of C"'^, . . . , C"'" into in the natural way. This 
padding operation neither affects the probability of the swap tests passing nor changes the distance 
to the closest product state. 

Proof. Let J-' denote the swap (or flip) operator that exchanges two quantum systems of equal but 
arbitrary dimension, with J^s denoting the operator that exchanges only the qudits in the set S. 
Then we have 

-Ptest(p, 0-) = tr(p a) I j = ^ XI ® = ^ tr p5(75. 

^ ^ SC[n] SC[n] 

The second part then follows from Lemma 13. □ 

We now analyse the probability of the product test passing for general n. We first note that, 
in the special case where n = 2, it is possible to analyse the probability of passing quite tightly. 
The proof of the following result, which is implicit in previous work of Wei and Goldbart [46], is 
essentially immediate from Lemma 15. 

Lemma 16. Let G C^^ ® C^^, where di < d2, be a bipartite pure state with Schmidt coefficients 
> ^/X2 > • • • > a/ Xdi ■ Then 



while 



In particular, 



1 - e := , max |(V'|0i)|</>2)P = Ai. 

1 - e + ^p^ri)^' ^ Ptest{m^\) < 1 - e + e2. 

We are finally ready to prove Theorem 1. The proof is split into two parts, which we formalise 
as separate theorems. The first part holds when e is small, and depends on the results proven in 
Appendix A. The second part holds when e is large, and is proved using the first part. 

Theorem 17. Given |V') e (g) • • • (g) C^", let 

1 - e = maxIKV'l./'i, ■ ■ ■ An)? ■ \(t>i) e&\l<i<n}. 

Then 

1 - 2e + < Ptest{\^){^) < 1 - e + + e^/^. 

Proof. The lower bound holds by general arguments. It is immediate that, if applied to . . . , 
the product test succeeds with probability 1. As the test acts on two copies of 1^/)), which has overlap 
1 — e with 101, . . . , (j)n), it must succeed when applied to IV') with probability at least (1 — e)^. The 
upper bound follows from Lemma 15 and Theorem 14. The statement of Theorem 14 only explicitly 
covers the case where the dimensions of all the subsystems are the same; however, as noted above, 
we can assume this without loss of generality. □ 
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This result is close to optimal. At the low end, the state |^) = \/l — e|0") + y^ll") has 
PteM)m = l-2e + 2e^ + o{l). At the high end, for |V^) = ^/T^lOO) + Ptcst(|^)(^|) = 

1 — e + e^. We also note that this result does not extend to a test for separability of mixed states; 
the maximally mixed state on n qudits is separable but it is easy to verify that Ptesti^/d^) = 
{{d+ l)/2d)^, which approaches zero for large n. 

Theorem 17 only gives a non-trivial upper bound on the probability of passing when e is small 
(up to e = ^{3-V5)^ 0.38). We now show that the product test also works in the case where the 
state under consideration is far from any product state. We will need two lemmas. 

Lemma 18. Given {ip) G (X"- • •(giC"'", let P^^gf^tp) {tpD be the probability that the P-product test - 
the test for being product across partition P - passes. Then, for all P, Pt'estiW {'^\) ^ Ptest{\'^) ■ 

Proof. The subspace corresponding to the usual product test passing is contained within the sub- 
space corresponding to the P-product test passing. □ 

Lemma 19. Let , he pure states such that = 1 — e, and let P he a projector. Then 

mp\i^)-{m4>)\<v^- 

Proof. We can directly calculate ^|| ~ \'t>){(t>\ 111 = V^- This then gives the claimed upper 

bound on | trP(| '(/')('(/' I - |'?!')('?!'|)| (see [39, Chapter 9]). □ 

Theorem 20. Given |V') e (g) • • • (g) C^", let 

1 - e = max{|(^/;|0i,. . . ,(7>„)|^ : |0i) G C''% 1 <i<n}. 
Then, if e > 11/32 > 0.343, Ptesti]^^) {M) < 501/512 < 0.979. 

Proof. For simplicity, in the proof we will use a quadratic upper bound on Ptest(|'0)('0l) that follows 
by elementary methods from Theorem 1: Ptcst(|V')(V'l) ^ 1 ~ |e-|-2e^. For a contradiction, assume 
that Ptest(|V')(V'l) >P-= 501/512, while e > 11/32. 

For any partition P of [n] into 1 < k < n parts, let \(pp) be the product state (with respect to 
P) that maximises \{ip\4>)\'^ over all product states |(/>) (with respect to P). If 

i-h<M^p)\^ <i-e, 

where for readability we define £ := 1/32 and h := 11/32, then by the quadratic bound given 
above the P-product test passes with probability P^^^.{\tl;) (ipl) < p, implying by Lemma 18 that 
-Ftest(|'0)('0l) ^ P- Therefore, as we are assuming that is a counterexample to the present 
theorem, there exists a k such that > 1 — £ for some that is product across k parties, 

and yet |(?/'|?!>)P < 1 — /i for all \4>) that are product across A; -|- 1 parties. 

So, for this k, let l^i) • • • \4>k) be the state that maximises . . . , 0fe)p. Thus there is some 

e' < £ such that we can write as 

W = Vi^\ci>i)---\<Pk) + V^\0, 

and by the arguments at the end of Theorem 14, the i^^ marginal of |^) has support orthogonal to 
\(f)i). Assume without loss of generality that \4>i) is a state of two or more qudits. Now we know 
that 

max \{M,iM,2)\\^-^')<-^-h, (3) 

l?'l,l)'l?'l,2> 
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l"?^! i)!*^! 2)l'/'2) • • • I'/'fc) would be a (/c + l)-partite state with overlap at least 1 — h with | ■(/')• (Here 
we have used the fact that |^) is orthogonal to 2)\4'2) • • • for any choice of 2)-) 

Let 1-6 = max|<^/^^^)_|<^/^^^) K0i|0'i,i)l0'i,2)P- Then Eq. (3) implies that 

, l-h 1-h 21 

137 < 137 = 31- 

Using Lemma 16, we find that Ptest(|V'i)(V'i|) < 1 — S + 5"^ < 751/961. Next we use Lemma 19 to 
obtain 

Ptest(|V')(V'l) < /'testl I </>!)(</>! I ^ " " " ^ lA) {^k\) + ^ 
<Ptest(|'/'l>(</'l|) + V£ 

751 n~ ^ 

But we previously assumed that Ptest(|V')(V'l) > P > 0.978. We have reached a contradiction, so 
the proof is complete. □ 

One might hope that this theorem could be improved to show that, as e — t- 1, Ptcst(|V')(V'l) 
necessarily approaches 0. However, this is not possible. Consider the d x d-dimensional bipartite 
state 1$) = -^^Yfi=\ KO- It is easy to verify using Lemma 16 that Ptest(|^)(*|) = 1/2(1 + 1/d) 
while max|^^)_|^2> l(*l0i)l02)P = 1/c^- 

Combining Theorems 17 and 20, we obtain Theorem 1 and thus have proven correctness of the 
product test. The constants in Theorem 20 have not been optimised as far as possible and could 
be improved somewhat. 



C Optimality of the product test 

Our test has perfect completeness in the sense that if |^) is exactly a product state then it will 
always pass the product test. It is hard to precisely define soundness, since no state is orthogonal 
to all product states: however, we can say that our test has constant soundness in that if has 
overlap at most 1 — e with any product state then it will pass the product test with probability at 
most 1 — 0(e). 

In fact, if we consider only product-state tests with perfect completeness, then we can show 
that our test has optimal soundness: that is, it rejects as often as possible given the constraint 
of always accepting product states. More generally, suppose that a product-state test T is given 
as input. Since the outcome of the test is binary, we can say that T is an operator on the 
nfc-qudit Hilbert space with < T < / and that the test accepts with probability trT^®*^. 

Let S be the set of product states in C^i ® • • • ® C^", and define to be the span of {\<^)®^ : 
|</>) G 5*}. For a single system C^, the span of : |0) G C^} is denoted Sym'' C^. This is 

the symmetric subspace of (C^)®^', meaning that it can be equivalently defined to be the set of 
vectors in (C'^)®'^ that is invariant under permutation by the symmetric group S]^. This fact allows 
the projector onto Sym'^C'^, which we denote Hj^™, to be implemented efficiently [6]. Also, it 

implies that = Sym'^C'^i (g) • • • (g) Sym'^C'^" and that the projector onto , denoted H^fc, is 

Now we return to our discussion of product-state tests. If XtT(\)®^ = 1 for all G 5, then 
T > Hgfc. Thus, T will always accept at least as often as II gk will on any input, or equivalently. 
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taking T = Il^fe yields the test which rejects as often as possible given the constraint of accepting 
every state in S'^. 

To understand H^a:, note that the projector onto Sym*^ is given by ^ YlneSk -^(^)' where 

Pi'^)= XI Ki'---'^fe)(Mi)'---'Mfc)l- 

For k = 1, Sym^ simply equals C^, and H^i is the identity operator on (C'')®'^. Thus, no 
non-trivial product-state test is possible when given one copy of \^p). 

When k = 2, Sym^ is the +1 cigcnspacc of (/ -|- J')/2, which is the space that passes the 
swap test. Thus, the product test (in Protocol 1) performs the projection onto and therefore 
rejects non-product states as often as possible for a test on |V')®^ that always accepts when is a 
product state. These arguments also imply that given projecting onto S'^ yields an optimal 

fe-copy product-state test of The strength of these tests is strictly increasing with k, but we 
leave the problem of analysing them carefully to future work. 

Finally, this interpretation of the product test allows us to consider generalisations to testing 
membership in other sets S. The general prescription for a test that is given k copies of a state is 
simply to project onto the span of '■ \'<P) ^ iS}- However, we will not explore these possibilities 

further in this paper. 



D Proof of correctness of the protocol to put QMA(/c) in QMA(2) 

This section proves several of the claims made in Section 3. First we prove Lemma 3 (which strictly 
subsumes Lemma 2) by showing the validity of Protocol 2. 

Proof. It is obvious that this protocol achieves completeness c: if the Merlins follow the protocol, 
the product test passes with certainty, and hence Arthur accepts with the same probability that A 
accepts, which is at least c. Showing soundness is somewhat more complicated. 

We first show that we can assume that the two states Arthur receives are identical. Imagine 
that this does not hold, and Arthur receives different states \(p),\ip). Then the probability that the 
product test accepts is 



SC[k] 5C[fc] 

^ tr 01 + tr 

5C[fc] 

= ^(^test(0)+i'test(^)), 

where the first inequality is Cauchy-Schwarz and the second is the AM-GM inequality. In the second 
stage, we run A on the states supplied by the first Merlin. However, after passing the product test, 
the resulting state is symmetric under exchange of the two Merlins. Therefore, running A on the 
systems from the first Merlin is equivalent to randomly choosing to run A on the system of either 
the first or the second Merlin. We conclude that the probability that the whole algorithm accepts 
is also upper bounded by the average probability of it accepting when run on and |(^). So, to 
achieve maximal probability of accepting, the two states might as well be identical. 

To prove the remainder of Lemma 3, we will need the following "gentle measurement" lemma. 
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Lemma 21 (Gentle measurement lemma [47, 41]). Let p be a density operator, and let < X < I 
be a projector such that ti pX > 1 — d. Then ^\\p — < s/S. 

Assume that the maximum overlap of |^) with a product state is 1 — e and that the product test 
accepts \if;)^^ with probability 1 — 5. Recall from Theorem 1 that S > -g^e- Next, by Lemma 19, 
A accepts [i/j) with probability < s + ^/e. Thus, Lemma 21 implies that, conditioned on passing 
the product test, we are left with a state that has probability < s + ^/e + s/S of being accepted by 
A. Combining the two tests, we find that the acceptance probability is 

s' < max min(l - 5, s + ^/e + Vd) < 1 - ^ . (4) 

The last inequality is the result of a lengthy, but boring, calculation. (For readers desiring a 
simple self-contained proof, we point out that the k = 2 case is particularly simple, and admits 

the stronger bound s' < (3 + s)/4. The proof is banished to a footnote^. Moreover, the result that 
QMA„i(2)s c ^ QMA2^^(2)3+s ^, together with Lemma 4 and the results of [32, 2] can be used to 
obtain an alternate proof of Theorem 6.) 

As a result of Eq. (4) , a /c-prover soundness-s protocol can be simulated by a 2-prover protocol 
with soundness s'. If k < poly(n), then the messages will still have a polynomial number of qubits. 

Finally, we observe that the resulting protocol is separable (as pointed out in Section 3), and 
so we have in fact a protocol that meets the definition of QMAf^{2)s'^c- D 

Next, we prove Lemma 4. 

Proof. In the original protocol, Arthur performs a measurement M on k states, each comprising m 
qubits. If the input is a YES instance, then there exists a product state on which M has expectation 
value > c, whereas if the input is a NO instance, then for all product states, M has expectation 
value < s. 

For the modified protocol, each of the k provers submits im qubits, and Arthur's measurement 
is M®^ . If the input is a YES instance, then the provers can submit I copies of the optimal input 
to the original protocol. This state is still product (across the k provers) and has probability > 
of being accepted. 

The more interesting case is when the input is a NO instance. In general, the provers can submit 
states that are entangled across the i different parts of their message. However, there cannot be 
any entanglement between the k different provers. 

For the new protocol, we can imagine Arthur sequentially performing the measurement {M, / — 
M} I times and accepting only if the outcome is M each time. Since the input is a NO instance, if 
this measurement is applied to a state, pure or mixed, that is separable across the k parties, then 

^Let the Schmidt decomposition of \'4)) be Y.^ ® I*)- K IV'') := ■ i+£^£fi 1^)^1-42 1^^^232^ ^here 

F is the swap operator, then the probabihty of passing the swap test is and the probabihty of passing the 

entire protocol is {ip'\M^'^^^\tp') . Now we calculate 

(V^lM^i^^lV-') = ({V'W>+v^(V'|M|^,^>+^/A;(^,^|M|V')) +i^A^^^^ < 
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outcome M will occur with probability < s. Therefore the probability that all £ measurements have 
outcome M will be < as long as conditioning on outcome M does not induce any entanglement 
in the unmeasured states. We need not consider the state that remains after outcome / — M, since 
Arthur rejects immediately when this outcome occurs. 

The final step of the proof is to show that applying the measurement {M, / — M} to the first 
system of a multipartite product state and obtaining outcome M will not create any entanglement 
across the k provers. Suppose that the i^^ Merlin supplies the £m-qubit state , where 

each is an m-qubit system. The original measurement M is separable, so can be written as 

for Qj > and \c(^j^) unit vectors on m qubits. This measurement is applied sequentially to p^^'"'^ ^ 
then P^''"'^ , and so on until Pj''"'^ . (Here we write ph'"'^ as shorthand for Pj ■ ■ ■ ■) When the 
measurement is applied to p^' ' '^ and the outcome is M, the residual state is proportional to 



= E (2) («r i"'^ ® ^''^ ^ iv'^) (</'(^) r^--^ \afY^ ® /^--.^ , 

j i=l 

which is separable across the P^ £ '■ P2 e ' ' ' ' ' ^2 t Therefore, by induction, the state 
always remain separable as long as outcome M always occurs. □ 

For the reader's convenience, we briefly summarise the proof of Lemma 5 here. 

Proof. The idea is to repeat the basic protocol £ times, and accept if there are > "accept" 
outcomes or reject otherwise. For YES instances, the provers can send i copies of the same proofs, 
each of which will be accepted with probability > c. Then a Chernoff bound yields that the 
completeness is > 1 — exp(— ^(c — s)^/2). For NO instances, each of the i copies has probability 
< s of being accepted. Since the provers may submit entangled states, we can no longer guarantee 
that these events are independent, but still Markov's inequality implies that the probability of 
> ^£ accept outcomes is < ^ = < 1 — This last step uses the fact that 

l/(l + a;/2)< l-a;/3, forO<a;< 1. □ 

We now complete the proof of Theorem 6. 

Proof. For the case of perfect completeness, we use Lemma 3 to replace the original fc-prover 
protocol with a 2-prover protocol that still has perfect completeness, still has 1 — l/q{n) soundness 
(where q{n) is still a polynomial of n) and now has a separable measurement performed by the 
verifier. Now we simply repeat (using Lemma 4) q{n) ■ p{n) times, and obtain soundness < (1 — 

l/g(n))9Hp(") < e"P("). 

When we merely have c — s > l/g(n), for some polynomial q{n), then we first have to apply 
Lemma 5 with i = p{n) + log(243g(n)^p(n)) to replace the soundness with 1 — l/3g(n) and the 
completeness with 1 — 2ilq{j^^p{n) • -l^^^t, we apply Lemma 3 to leave the completeness the same, 
reduce the number of provers to 2, guarantee the measurement is in SEP and replace the soundness 
with 1 — l/243g(n)^. Finally, we repeat 243g(ra)^p(ra) times and obtain soundness exp(— p(n)) and 
completeness 1 — exp(—p(n)). □ 
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E QMA(2) protocols and the maximum output oo-norm 



In this section, we elaborate on the connection between the maximum acceptance probabihty of a 
QMA(2) protocol and the quantity ^^"(AA) (or equivalently ||AA||i^oo) of a quantum channel M. 
This connection has been known for some time as folklore and has appeared before in Ref. [36]. 

The idea is seen most simply when the verifier performs a projective measurement M on the 
witness p € SEP((ii,(i2). In this case, we can define N to be the channel from suppM to B{di) 
(the space of di x di density matrices) that is obtained by embedding supp M into C^i ® C^^ and 
then tracing out the second system. The operator M is related to the Stinespring embedding [43] 
of J\f. Now ||A^||i->oo is the largest eigenvalue of the least mixed state in the output of A/", which 
is equivalently the least entangled state in suppM, which in turn is the maximum acceptance 
possibility of the protocol. In case M isn't a projector, then the corresponding M is now trace 
non-increasing instead of trace preserving, but the same argument still goes through. We formalise 
the correspondence as follows: 

Proposition 22. If M is a trace non-increasing channel with output space B{di) and d2 Kraus 
operators, then there exists an operator M G M{di,d2) satisfying < M < I such that 



Similarly, for any such operator M, there exists a corresponding channel M satisfying (5). 

Proof. Given M, we define M as the channel from B(rankM) to B{di) defined by M{X) = 
tT2{M^/^XM^/^). In what follows, if F is a vector space, define B{V) to be the set of unit 
vectors in V. 



max \\tT2M^/'^ibM^/^\\oc 

|V>eB(suppM) 

= max max tr(/di 'Si r])M^/'^ibM^/'^ 

|V>6S(suppM) |r?) £5(^*2) 

= max max max tr(/3 (g) r))M^^^ibM''-^^ [(Id-, S r))M^^'^il;M''-/^ has rank 

|V>eB(suppM) \r))eB{C<i2) |/3)eB(C'*i) 

= max max tr M^/^(/3 ® ?7)M^/^ [M^/^(^ (g) r/)M^/^ has rank 

|»7)eB(C<*2) |/3)eB(C''i) 

= max max tr M(/3 (8) rj) = max tr Mp. 

|jj>eB(C<*2) \i3)eB{C''i) peSEP(di,(i2) 

These arguments can also be run backwards. In this case, we are given a channel M and need only 
to find an M such that M{X) = tr2(MV2xMV2). if the Kraus form of AT is M{X) = X;*=i NiXNj 
(with NjNi < I) then this is achieved by taking M to be Yli=i ^^i^j S \i){i\- □ 

As a result of Proposition 22, the question of whether QMA(2) protocols can be amplified to 
exponentially small error is directly related to the question of multiplicativity of the maximum out- 
put infinity norm (equivalently, additivity of the minimum output min-entropy) . Indeed, additivity 
violations for 5'™" (e.g. [28, 27, 23]) translate directly into QMA(2) protocols for which perfect 
parallel repetition fails^. Conversely, [36] observed that QMA(2) protocols obey perfect parallel 
repetition when the corresponding channel Af is known to have additive S^'^, for example when 
J\f is entanglement breaking. 

^Note that, taking the standard definition of QMA(2), this is strictly speaking only true if the corresponding 
QMA(2) protocol can be implemented in polynomial time. 




(5) 
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F Proof of correctness of the product unitary test 



This appendix is devoted to the proof of Theorem 12. In order to analyse the product unitary test 
in Protocol 3, we will need to relate the maximum overlap of an n-qudit unitary with a product 
operator to the maximum overlap of that unitary with a product unitary. 

Lemma 23. Given U G U{<F-), let 

1 - e = max{|(C/, ®---® An)\^ : A, e M(d), {Ai,Ai) = 1, 1 < i < n}. 

Then, if e < 1/2, there exist Vi, ... ,Vn G U{d) such that \{U,Vi ■ ■ ■ Vn)f > {I - 2ef . 

Proof. For all 1 < z < n, let the polar decomposition of Ai be |Aj|Ci, where \Ai\ = ^ A^Aj and 
d e U{d). Set A = Ai,C = Ci. Then 

/C,A) = —Y\trCj\Ai\Ci = —Y\tT\AA = — max \tTVA\> VT^. 

1=1 1=1 
This implies that we can expand 

U = VT^A + D, C = Vl- e' A + E 

for some e' < e and matrices D, E such that {D, D) = e, {E, E) = e', {A, D) = 0, {A, E) = 0. So 

\{U,C)\ = \^/T^e^/l^+{D,E)\ > \^/l^e^/T^ - > 1 - 2e, 

for e < 1/2. This implies the lemma. □ 

We are now ready to prove correctness of the product unitary test. 

Proof of Theorem 12. By the Choi-Jamiolkowski isomorphism, there is a direct correspondence 
between operators M G M{d) with |(M, M)| = 1 and normalised quantum states \v{M)). If we 
define 

1 - e' := max{|(i7, Ai ® ■ ■ • (g) : A^ G M{d), (A, A) = 1,1 < « < n}, 

then by Theorem 1, if e' < 0.0265, Ptest(t^) < 1 - e' + e'^ + and if e' > 0.0265, Ptest(t^) < 
501/512. If e' > 1/2, then the result follows immediately. On the other hand, by Lemma 23, 
if e' < 1/2, there exist Vi,...,Vn G U{d) such that \{U,Vi (g> ■ ■ ■ (S> K)p > (1 - 2e'f > 1 - 4e'. 
Thus we have \e < e' < e. The theorem follows by combining the bound on e and the bound on 

PtestiU). □ 



G Interpretation of the product test as an average over product 
states 

Wc have seen (via Lemma 15) that the probability of the product test passing when applied to 
some state IV') G {C^ f is equal to the average purity, across all choices of subsystem SC. [n], 
of tr \iIj){iI)\s. One interpretation of the proof of correctness of the product test is therefore that, 
if the average entanglement of \tp) across all bipartite partitions of [n] is low, as measured by the 
purity, then lip) must in fact be close to a product state across all subsystems. 

In this appendix, we discuss a similar interpretation of our results in terms of an average over 
product states, via the following proposition. 
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Proposition 24. Given \tp) e (C*)®", 



Proof. Similarly to before, let the input to the product test be two copies tpA, ipB oi a state 
ijj := and let T denote the swap operator that exchanges systems A and B. Then 

E|<^i>,...,|<^„> [KV'I'/>1, • • • , 0n)|^] = M^A ^ V'i?)((</'1 ® • • • ® K)a ® (</>! ® • • • ® 0n)s)] 

= tr(V'A®V'B)(E|^)[<^A^0B])®" 

I -\~ J- \ ®" / 2 ^ " 



□ 



We note that, in this interpretation, our main result is reminiscent of the so-called inverse 

theorem for the second Gowers uniformity norm [20, 21], which we briefly outline. Let / : {0, 1}"^ — t- 
M be some function such that ^ fix)"^ = and let the p-norms of / on the Fourier side be 

defined as = \ y2xe{o,i}" 2^J2ye{o,i}"{'~^T^fiy) ) ■ Then it is straightforward to show 

that 11/11^ < II/II4 ^ ll/lloo) where the quantity in the middle is known as the (fourth power of) 
the second Gowers uniformity norm of /. That is, ||/||^ (representing the largest overlap of / with 
a parity function) is well approximated by WfWf (the average of the squared overlaps with parity 
functions). This simple approximation has proven useful in arithmetic combinatorics [20]. 

Via the correspondence of Proposition 24, Theorem 1 shows that a similar result holds if we 
replace the cube {0, 1}" with the space (C^)®": the largest overlap with a product state can be 
well approximated by the average squared overlap with product states. Note that if one attempts 
to use the classical proof technique for the Gowers uniformity norm to prove this result, one does 
not obtain Theorem 1, but a considerably weaker result containing a term exponentially large in n. 
Intuitively, this is because the set of overlaps with parity functions for some function / : {0, 1}" — > M 
is essentially arbitrary, whereas the set of overlaps of some state [V') with product states is highly 
constrained. 
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